k-Approximate Quasiperiodicity Under Hamming and Edit Distance

نویسندگان

چکیده

Abstract Quasiperiodicity in strings was introduced almost 30 years ago as an extension of string periodicity. The basic notions quasiperiodicity are cover and seed. A a text T is whose occurrences all positions . seed superstring In various applications exact still not sufficient due to the presence errors. We consider approximate quasiperiodicity, for which we allow with small Hamming, Levenshtein or weighted edit distance. previous work Sim et al. (J Korea Inf Sci Soc 29(1):16–21, 2002) Christodoulakis Autom Lang Comb 10(5/6), 609–626, 2005) showed that computing covers seeds, respectively, under distance NP-hard. They, therefore, considered restricted seeds need be factors original presented polynomial-time algorithms them. Further algorithms, considering Hamming bounded by k , were given several contributions Guth They also studied relaxed quasiperiods. present more efficient seeds. particular, improve upon complexities many aforementioned Our solutions especially if number (or total cost) allowed errors small. show conditional lower bounds prove NP-hardness non-restricted

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Approximate String Searching under Weighted Edit Distance

Let p 2 be a string of length m and t 2 be a string of length n. The approximate string searching problem is to nd all approximate matches of p in t having weighted edit distance at most k from p. We present a new method that preprocesses the pattern into a DFA which scans t online in linear time, thereby recognizing all positions in t where an approximate match ends. We show how to reduce the ...

متن کامل

Restricted Transposition Invariant Approximate String Matching Under Edit Distance

Let A and B be strings with lengths m and n, respectively, over a finite integer alphabet. Two classic string mathing problems are computing the edit distance between A and B, and searching for approximate occurrences of A inside B. We consider the classic Levenshtein distance, but the discussion is applicable also to indel distance. A relatively new variant [8] of string matching, motivated in...

متن کامل

MACFP: Maximal Approximate Consecutive Frequent Pattern Mining under Edit Distance

Consecutive pattern mining aiming at finding sequential patterns substrings, is a special case of frequent pattern mining and has been played a crucial role in many real world applications, especially in biological sequence analysis, time series analysis, and network log mining. Approximations, including insertions, deletions, and substitutions, between strings are widely used in biological seq...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Algorithmica

سال: 2021

ISSN: ['1432-0541', '0178-4617']

DOI: https://doi.org/10.1007/s00453-021-00842-7